[core] respect `local_files_only=True` when using sharded checkpoints #12005

sayakpaul · 2025-07-28T14:48:50Z

What does this PR do?

See: #11948

This reverts commit 8d431dc.

HuggingFaceDocBuilderDev · 2025-07-28T15:00:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2025-08-12T14:04:07Z

src/diffusers/utils/hub_utils.py


    ignore_patterns = ["*.json", "*.md"]
    # `model_info` call must guarded with the above condition.
-    model_files_info = model_info(pretrained_model_name_or_path, revision=revision, token=token)


So the purpose of this check is to verify if the necessary sharded files are present in the model repo before attempting a download, presumably to avoid a large download if all files aren't present. If we cannot connect to the hub, we just have to assume the necessary shard files are already present locally.

I think we can just skip this check if local_files_only=True and then check if all the shard filenames are present in the cached_folder

How about now?

I think just this is sufficient

if not local_files_only: # run model_info check

Run snapshot download

Then after the cached_filenames is created, iterate over the files to verify they exist

for filename in cached_filename: if not if not os.path.exists(filename): raise EnvironmentError("expected file not present in {cached_folder}")

We don't have to run snapshot_download() when local_files_only=False, that might be unnecessary.

Why run snapshot_download() after also running model_info()?

Even if we run snapshot_download() regardless of local_files_only var, I think we should have it inside try-except in case the endpoint cannot be pinged for some reason and raise the ConnectionError as before.

See if b7af511 resolves this.

Ah I see what you mean. Let me update. Sorry about the back and forth.

sayakpaul · 2025-08-13T09:39:57Z

@DN6 see if the latest changes work for you.

DN6 · 2025-08-14T03:13:15Z

src/diffusers/utils/hub_utils.py

            cached_folder = os.path.join(cached_folder, subfolder)

+        # Check again after downloading/loading from the cache.
+        model_files_info = _get_filepaths_for_folder(cached_folder)


We don't need a new function to walk over the cached folder. If you just iterate over cached_filenames and check if the file exits. Avoid looping over files multiple times this way.

for cached_file in cached_filenames: if not os.path.exists(cached_file): raise EnvironmentError(f"{cached_file} not found in {cached_folder which is required..."

sayakpaul added 3 commits July 28, 2025 13:27

tighten compilation tests for quantization

8d431dc

feat: model_info but local.

69920ef

up

d5c1772

sayakpaul requested review from DN6 and yiyixuxu July 28, 2025 14:48

Revert "tighten compilation tests for quantization"

f38a644

This reverts commit 8d431dc.

sayakpaul mentioned this pull request Jul 28, 2025

Impossible to load WanTransformer3DModel when offline using the 'from_pretrained' function. #11948

Open

sayakpaul added 3 commits July 29, 2025 13:58

Merge branch 'main' into local-model-info

2d993b7

Merge branch 'main' into local-model-info

85279df

Merge branch 'main' into local-model-info

d117474

DN6 reviewed Aug 12, 2025

View reviewed changes

sayakpaul added 2 commits August 12, 2025 20:20

Merge branch 'main' into local-model-info

71843a0

up

fb2397f

sayakpaul requested a review from DN6 August 12, 2025 14:57

Merge branch 'main' into local-model-info

832de66

sayakpaul mentioned this pull request Aug 13, 2025

_get_checkpoint_shard_files offline mode fix #12132

Closed

4 tasks

sayakpaul added 5 commits August 13, 2025 14:16

Merge branch 'main' into local-model-info

01784c3

reviewer feedback.

b7af511

reviewer feedback.

04cd2dc

up

1c528a4

up

1b939e5

sayakpaul added 2 commits August 13, 2025 20:46

empty

2a9734f

Merge branch 'main' into local-model-info

09e063c

DN6 reviewed Aug 14, 2025

View reviewed changes

update

7fd1a82

DN6 approved these changes Aug 14, 2025

View reviewed changes

DN6 merged commit 1b48db4 into main Aug 14, 2025
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] respect `local_files_only=True` when using sharded checkpoints #12005

[core] respect `local_files_only=True` when using sharded checkpoints #12005

Uh oh!

sayakpaul commented Jul 28, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 28, 2025

Uh oh!

DN6 Aug 12, 2025

Uh oh!

sayakpaul Aug 12, 2025

Uh oh!

DN6 Aug 13, 2025 •

edited

Loading

Uh oh!

sayakpaul Aug 13, 2025 •

edited

Loading

Uh oh!

sayakpaul Aug 13, 2025

Uh oh!

sayakpaul Aug 13, 2025 •

edited

Loading

Uh oh!

sayakpaul commented Aug 13, 2025

Uh oh!

DN6 Aug 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[core] respect local_files_only=True when using sharded checkpoints #12005

[core] respect local_files_only=True when using sharded checkpoints #12005

Uh oh!

Conversation

sayakpaul commented Jul 28, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 28, 2025

Uh oh!

DN6 Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

DN6 Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 13, 2025

Uh oh!

DN6 Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[core] respect `local_files_only=True` when using sharded checkpoints #12005

[core] respect `local_files_only=True` when using sharded checkpoints #12005

DN6 Aug 13, 2025 •

edited

Loading

sayakpaul Aug 13, 2025 •

edited

Loading

sayakpaul Aug 13, 2025 •

edited

Loading